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Abstract. Consider random symmetric Toeplitz matrices T n = (dj_ j^j—i 
with matrix entries aj,j = 0, 1, 2, • • ■ , being independent real random variables 
such that 



E[aj] = 0, E[|a 3 | 2 ] = l for j = 0,1,2, • 



2 

(homogeneity of 4-th moments) 

K = E[|Oj| 4 ], 
and further (uniform boundedness) 

supE[|aj| fc ] = C k < oo for k > 3. 

j>0 

Under the assumption of ao = 0, we prove a central limit theorem for linear 
statistics of eigenvalues for a fixed polynomial with degree > 2. Without the 
assumption, the CLT can be easily modified to a possibly non-normal limit 
law. In a special case where a,j 's are Gaussian, the result has been obtained by 
Chatterjee for some test functions. Our derivation is based on a simple trace 
formula for Toeplitz matrices and fine combinatorial analysis. Our method 
can apply to other related random matrix models, including Hankel matrices 
and product of several Toeplitz matrices in a flavor of free probability theory 
etc. Since Toeplitz matrices are quite different from the Wigner and Wishart 
matrices, our results enrich this topic. 
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1. Introduction and main results 

Toeplitz matrices emerge in many aspects of mathematics and physics and also 
in plenty of applications, see Grenander and Szego's book [T7J for a detailed intro- 
duction to deterministic Toeplitz matrices. The study of random Toeplitz matrices 
with independent entries is proposed by Bai in his review paper [3] . Since then, the 
literature around the asymptotic distribution of eigenvalues for random Toeplitz 
and related matrices is very large, including the papers of Basak and Bose [6 , Bose 
et al. 011], Bryc et al. [TO], Hammond and Miller [IS], Kargin 23 j, Liu and Wang 
[35], Massey et al. [37], etc. We refer to [5] for recent progress. However, the study 
of fluctuations of eigenvalues for random Toeplitz matrices is quite little, to the 
best of our knowledge, the only known result comes from Chatterjee in the 
special case where the matrix entries are Gaussian distributions. In this paper we 
will derive a central limit theorem (CLT for short) for linear statistics of eigenvalues 
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of random Toeplitz and related matrices, for some of which the asymptotic distri- 
butions of eigenvalues are also new, including sparse Toeplitz and Hankel matrices 
and singular values of powers of Toeplitz matrices. 

In the literature fluctuations of eigenvalues for random matrices have been ex- 
tensively studied. The investigation of central limit theorems for linear statistics of 
eigenvalues of random matrices dates back to the work of Jonsson [22] on Gaussian 
Wishart matrices. Similar work for the Wigner matrices was obtained by Sinai and 
Soshnikov [3UJ . For further discussion on Wigner (band) matrices and Wishart ma- 
trices and their generalized models, we refer to Bai and Silverstein's book 0], recent 
papers [2j 111) and the references therein. For another class of invariant random ma- 
trix ensembles, Johansson |21] proved a general result which implies CLT for linear 
statistics of eigenvalues. Recently, Dumitriu and Edelman [15] and Popescu [29] 
proved that CLT holds for tridiagonal random matrix models. 

Another important contribution is the work of Diaconis et al. [HJ |T3], who 
proved similar results for random unitary matrices. These results are closely con- 
nected to Szego's limit theorem (see 20 ) for the determinant of Toeplitz matrices 
with (j,k) entry g{j — k), where g{k) — ^L J Q n g(e l6 )e~' lke d9, see [HI [7] and refer- 
ences therein for connections between random matrices and Toeplitz determinants. 
Here we emphasize that Szego's limit theorem implies a CLT. 

Now we turn to our model. The matrix of the form T n = (ai-j)fj =1 is called a 
Toeplitz matrix. If we introduce the Toeplitz or Jordan matrices B — (<5i+i,j)™j = i 
and F — (Sij + i)^j =1 , respectively called the "backward shift" and "forward shift" 
because of their effect on the elements of the standard basis {ei, • • • ,e„} of K™, 
then an n x n matrix T can be written in the form 

n— 1 

(1.1) T X" / '" ' f ' 

3=0 J=l 

if and only if T is a Toeplitz matrix where a- n +i, • • ■ ,ao, • • • ,a n -i are complex 
numbers [24j . It is worth emphasizing that this representation of a Toeplitz matrix 
is of vital importance as the starting point of our method. The "shift" matrices B 
and F exactly present the information of the traces. 

Consider a Toeplitz band matrix as follows. Given a band width b n < n, let 



(1-2) Vij 



otherwise. 



Then a Toeplitz band matrix is 

(1.3) T n = (rjij ai_,-)£j=i- 

Moreover, the Toeplitz band matrix T n can also be rewritten in the form 



bn b n b n 

(1.4) T n = J2 a -j BJ + £ ajF* = a I n + £ {a- S B> + ajF j ) , 

3=0 3 = 1 3 = 1 

where /„ is the identity matrix. Obviously, a Toeplitz matrix can be considered 
as a band matrix with the bandwidth b n = n — 1. In this paper, the basic model 
under consideration consists of n x n random symmetric Toeplitz band matrices 
T n = (rjij a,i~j)fj =1 in Eq. (jl.3|) . We assume that aj — a_j for j — 1, 2, • • • , and 
{oj l^i is a sequence of independent real random variables such that 
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(1.5) E[ aj ]=0, E[| aj | 2 ] = l for J = 1, 2, ■ ■ - , 
(homogeneity of 4-th moments) 

(1.6) K = n\aj\% 
and further (uniform boundedness) 

(1.7) supE[|a,-|*] = C*<oo for k > 3. 

In addition, we also assume clq = (we will explain in Remarks II .31 and 15.51 below!) 
and the bandwidth b n — > oo but b n /n — > b € [0, 1] as n — > oo. 

Set A n = -^=, a linear statistic of eigenvalues A l7 • • • , A„ of A n is a function of 
the form 

1 - 

(1-8) r£/(*j). 

where / is some fixed function. In particular, when f(x) — x p we write 

r~ n 

On \ - 



(1.9) "P = ^rY,W-m\) 

These ui p , p = 2,3,..., are our main objects. Note that i Y^j=i (^j — ^[^]) 
converges weakly to zero as n — > oo, moreover under the condition 

it < °° 

3=1 " 

we have a strong convergence, see [23J [26] . We remark that the fluctuations 

n 1 71 

(i-io) =-E^--E E ^]- 

are of order -4= while those for Wiener matrices are of order 1- , more like the case of 
classical central limit theorem. This shows that the correlations between eigenvalues 
for Toeplitz matrices are much weaker than those for Wigner matrices (another 
different phenomenon is that the limiting distribution for random Toeplitz matrices 
has unbounded support, see [101 1 18 ). The potential reasons for this phenomenon 
may come from the fact that the order of the number of independent variables is 
0(n) for Toeplitz matrices while it is 0(n 2 ) for Wigner matrices. On the other 
hand, the eigenvalues of random Toeplitz matrices are obviously not independent, 
so it is different from the case of CLT for independent variables. 

In special case that the matrix entries aj are Gaussian distributions, by using his 
notion of 'second order Poincare inequalities' Chatterjee in [IT] proved the following 
theorem: 

Theorem f [11|. Theorem 4.5) Consider the Gaussian Toeplitz matrices T n = 
((H-j^j—i) i- e - o-j — a -j for j = 1, 2, • • • , and {a,j}j°. is a sequence of independent 
standard Gaussian random variables. Let p n be a sequence of positive integers such 
that p n = o(logn/ log log n). Let A n — T n /y/n, then, as n — >• 00, 

tr(i4£.) - E[tr(^»)] 



v/Var(tr(^")) 



converges in total variation to N(0, 1). 
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The CLT also holds for tr(f(A n )), when f is a fixed nonzero polynomial with non- 
negative coefficients. 

The author remarked that the above theorem is only for Gaussian Toeplitz matri- 
ces based on the obvious fact: considering the function f{x) = x, CLT may not hold 
for linear statistics of non-Gaussian Toeplitz matrices. The author also remarked 
that the above theorem says nothing about the limiting formula of the variance 
Var (tr( J 4^" )). However, we assert that CLT holds for a test function f(x) = x 2p 
even for non-Gaussian Toeplitz matrices. When f(x) = x 2p+1 the fluctuation is 
Gaussian if and only if the diagonal random variable ao is Gaussian. Moreover, 
if we suppose ao = 0, we can obtain CLT for any fixed polynomial test functions. 
On the other hand, we can calculate the variance in terms of integrals associated 
with pair partitions. Unfortunately, our method fails to deal with the test function 
f(x) — where p n depends on n. 

Our study is inspired by the work of Sinai and Soshnikov |30j . but new ideas 
are needed since the structure of Toeplitz matrices is quite different from that of 
Wigner matrices. In addition, our method can apply to other related random matrix 
models, including Hermitian Toeplitz matrices, Hankel matrices, sparse Toeplitz 
and Hankel matrices, singular values of powers of Toeplitz matrices, and product 
of several matrices in a flavor of free probability theory. 

Now we state the main theorem of our paper as follows. 

Theorem 1.1. LetT n be real symmetric ( hl.5\) - {l.T^ ) random Toeplitz band matrix 
with the bandwidth b n , where b n /n — > b G [0,1] but b n — > oo as n — > oo. Set 
A n = Tnl^fbn and 

(1.11) u . 

For every p > 2, we have 
(1.12) 

in distribution as n — » oo. Moreover, for a given polynomial 

v 

(1.13) Q(x)=Y J ^ 

3=2 

with degree p > 2, set 

(1.14) = — (trQ(A n ) - E[trQ(A.)]) , 

n 

we also have 

(1.15) w —►#(<), o£) 

in distribution as n — > oo . Here the variances a 2 and Oq will be given in section [^J 

From the proof of our main theorem, we can easily derive an interesting re- 
sult concerning product of independent variables whose subscripts satisfy certain 
"balance" condition. When p = 2, it is a direct result from the classical central 
limit theorem. Here we state it but omit its proof, see Remark 15.61 for detailed 
explanation. 



= ^™ (tr( A^)-E[tr(^)]). 
u p —>N{0,o*) 
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Corollary 1.2. Suppose that a, = a-j for j = 1, 2, • • • , and {aj} < jL 1 is a sequence 
of independent random variables satisfying the assumptions (|1.5[) - (11.7[) . For every 
P>2, 

-, n / p p \ 

(1.16) — £ II a i« " E (Q a *] K * ■ 

converges in distribution to a Gaussian distribution N(0, a p ). Here the variance a p 
as in section S] corresponding to the case 6 = 0. 

Remark 1.3 (on the diagonal entry a ). Rewrite A n (a a ) = A n {Q) + -j£=I n , where 
A n (0) denotes the matrix with ao = 0. It is easy to see that 

(1.17) u p (a Q ) = wp(0) + p tl{APn 1(0)) (a - E[a ]) + O^ 72 ), 

n 

which converges in distribution to the distribution of a p n + pM p _i(«o — E[ao]). 
Here the n is the standard normal distribution, independent of ao, and M p _i is the 
(p-l)-th moment in Theorem 13.11 Since M p = p is odd, if ao is neither a 
constant a.s. nor Gaussian then the fluctuation is not Gaussian for odd p. 

The remaining part of the paper is arranged as follows. Some integrals associated 
with pair partitions are defined in section O Sections 2] and [5] are devoted to 
the proof of Theorem 11.11 We will extend our main result to other models closely 
connected with Toeplitz matrices in section [5] 



2. Integrals associated with pair partitions 

In order to calculate the moments of the limit distribution and the limiting 
covariance matrix of random variables uj p , we first review some basic combinatorical 
concepts, and then define some integrals associated with pair partitions. 

Definition 2.1. Let the set [n] — {1, 2, • • • , n}. 

(1) We call 7r = {Vi, • • • , V r } a partition of [n] if the blocks Vj (1 < j < r) are 
pairwise disjoint, non-empty subsets of [n] such that [n] — VxD- ■ -yjV r . The number 
of blocks of 7r is denoted by and the number of elements of Vj is denoted by 
1^1- 

(2) Without loss of generality, we assume that V%, ■ ■ ■ ,V r have been arranged 
such that si < S2 < ■ ■ ■ < s r , where Sj is the smallest number of Vj. Therefore we 
can define the projection ir(i) = j if i belongs to the block Vj; furthermore for two 
elements p, q of [n] we write p ~ T q if ir(p) = ir(q). 

(3) The set of all partitions of [n] is denoted by V(n), and the subset consisting 
of all pair partitions, i.e. all \Vj\ = 2, 1 < j < r, is denoted by Viiri). Note that 
T , 2{n) is an empty set if n is odd. 

(4) Suppose p, q are positive integers and p + q is even, we denote a subset 
of Vi(p + q) by V2{p,q), which consists of such pair partitions n: there exists 
l<i<p<j<p + q such that i j (we say that there is one crossing match in 
tt). 

(5) When p and q are both even, we denote a subset of V(p + q) by V2,4,(p, <?), 
which consists of such partitions n = {V\, ■ ■ ■ ,V r } satisfying 

(i) l^'l — 2, 1 < j < r, j 7^ i and \V,\ =4 for some i. 

(ii) Vj C {l,2,...,p}or{p+l,p + 2,...,p + q} for l<j<r,j^i. 
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(iii) two elements of V% come from {1,2, ... ,p} and the other two come from 
{p+l,p + 2,...,p + q}. 

For other cases of p and q, we assume T > 2,a(p,q) is an empty set. 

Now we define several types of definitive integrals associated with n G Vi{p,<l) 
or 7r G 7^2,4 (j>, <?)• For the reader's convenience, we suggest to omit them for the 
moment and refer to them when they are needed in sections [3] and 01 Let the 
parameter b 6 [0, 1]. 

First, for ir G V-z(j>,q) we set 



(2.1) 6)r (i) 



1, i is the smallest number of 7r 
— 1, otherwise. 



To every pair partition 7r G Vi(j>, <?), we construct a projective relation between two 
groups of unknowns y\, . . . , y p + q and £C \ ..... X p-\-g £IS follows: 

(2-2) e ff (i) j/j = e„(J) yj = x„ {i) 

whenever i ~ T j. Thus, we have an identical equation 

p+q 

(2.3) E^°- 

3=1 

For xo,yo G [0,1] and xi , . . . ,x P + g G [—1,1], we define two kinds of integrals 
with Type I by 

/f w = 

(2.4) 

p \ p 3 p+q 3 ~~ 2~~ 

y^Vi \\\x[o,i]{xo + by^yi) Yl X[o,i](yo + b yi)dy T[d 

i=i / j=i i=i j'=p+i ;=o 

and 

tfW = 

(2.5) 

P \ P J P+9 J ~~ 2~~ 

y^J/i )Y[x[q,i]{xq + b^2yi) Yl X[o,i](vo -b ^2 yi)dyoY[d 

i=l J j=l i=l j'=p+l i=0 

Here <5 is the Dirac function and \ is the indicator function. Note that Ysj=i Vj 7^ 
by the definition of p2(p,q), therefore the above integrals are multiple integrals in 
(£±2 + 1) variables. 

Next, for it — {Vi, . . . , V P +g 1 } G P2,4(p,q) (denoting the block with four ele- 
ments by Vi ), we set for ir(k) ^ i 



(2.6) r n (k) 
while for ir{k) = i 

(2.7) T n (k) = 



1, k is the smallest number of tt 1 (7r(fc)); 
— 1, otherwise 



1, k is the smallest or largest number of n 1 (n(k)); 
— 1 , otherwise. 
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To every partition 7r G V2a{p, q), we construct a projective relation between two 
groups of unknowns y\ , . . . , y P + q and X 1 j . . . , X p+q ^ 

follows: 

(2.8) r n (i) yi = r w (j) y 2 = x„(i) 

whenever i ~ v j. Then two kinds of integrals with Type II are defined respectively 

by 

(2.9) 

i ■/ p+q i 

P v j p+q 3 — — 1 

f7i( n )= p+, Il^l ' 1 ]^ + b ^2vt) II X[o,i](yo + b V yt)dy Jl d Xl 

i[o,i]^xhi,i]V i i=1 . =1 f=p+1 . =p+1 /=0 

and 

p j p+q j ~~ 2 — 1 

fn( 7T )= p+, n*. 1 ]^ 1 ^^ IT X[o,i](j/o -6 J/»)dj/o TT dxi. 

J[0,l] 2 x[-l,l] — S ^ p+1 f=0 
3. MATHEMATICAL EXPECTATION 

In this section, we will review some results about the moments of the limiting 
distribution of eigenvalues in [26], for the convenience of the readers and further 
discussion. 

Theorem 3.1. E[Hi(Al k )} = M 2k + o(l) and E[Hr(Al k+1 )} = o(l) as n — > oo 
where 

n 2 k J k 

(3.1) M 2k = / n^i"- 1 ]^ + 6 X! e7r ^) a;7r (' i ))n da; '- 

Let us first give a lemma about traces of Toeplitz band matrices. Although its 
proof is simple, it is very useful in treating random matrix models closely related 
to Toeplitz matrices. 

Lemma 3.2. For Toeplitz band matrices Ti >n = (rjij &i,-t-j)"j- = i with the bandwidth 
b n where a;,_ n +i, • • • , a;, n _i are complex numbers and I — \....,p, we have the 
trace formula 

n 

(3.2) tr(Ti, n ---T P)ri ) = V Vajlj 6 p , p G N. 

i=i / "• >: ' / 
#ere J = (ji,...,j p ) € {-&„, . . . , 6„} p , a,/ = nf=i a ij^ = Uk=i X[i,n](* + 

j Ji) afirf £/ie summation ^ j runs over possibilities that J £ {— 6 n , . . . , 
Proof. For the standard basis {ei, • • • , e„} of the Euclidean space R™, we have 

fc„ b„ fc„ 

J=0 3=1 j=-b n 

Repeating T^ n 's effect on the basis, we have 

b„ p p k 

T hn ■ ■ ■ T p>n ei = V TT a iJi TT^[i^K i + $Z^) e . 

n 

By tr(T 1)n • • • T Pi „) = e* T 1>n • • • T Pi „ e,, we complete the proof. □ 

i=l 
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We will mainly use the above trace formula in the case where Ti <n = ■ ■ ■ = T Pi „ = 
T n . Since ao = 0, from Kronecker delta symbol in the trace formula of (|3.2|) . it 

suffices to consider these J = (ji,. ■■ ,j p ) £ {±1, • ■ • ,±^n} p with the addition of 

p 

2 jk — 0. We remark that ao = is not necessary to Theorem 13.11 In fact it is 

k=l 

sufficient to ensure Theorem 13.11 if all finite moments of random variable ao exist 
and its expectation is zero. 

Definition 3.3. Let J = (ji, ■ ■ ■ ,j p ) £ {±1, . . . , ±& n } p , we say J is balanced if 

p 

jk = 0- The component j u of J is said to be coincident with j v if \j u \ = for 

k=l 

I < u =/= v < p. 

For J £ {±1, • ■ • , ±6„} p , we construct a set of numbers with multiplicities 

(3.3) Sj = {\ji\,...,\j P \}. 

We call Sj the projection of J. 

The balanced J's can be classified into three categories. 

Category 1 (denoted by Ti(p)): J is said to belong to category 1 if each of its 
components is coincident with exactly one other component of the opposite sign. 
It is obvious that Ti(p) is an empty set when p is odd. 

Category 2 (^(p)) consists of all those vectors such that Sj has at least one 
number with multiplicity 1. 

Category 3 (^(p)) consists of all other balanced vectors in {±1, . . . , ±b n } p . For 
J £ T^ip), either Sj has one number of at least 3 multiplicity, or each of Sj has 
multiplicity 2 but at least two of the components are the same, which are denoted 
respectively by T^iij)) and T^ip)- 

We are now ready to prove Theorem 13. II 



Proof of Theorem \3.1{ By Lemma 13. 2\ we have 
1 1 n 

(3.4, n-^Af )i = ^ E 2>m u s a ? 4 = E, + E 2 + 

where 



=1 j °>£^ 



1 

(3.5) E^^E E a ^> z = 1 ' 2 ' 3 - 

" t=l J<£Ti(2k) 

By the definition of the categories and the assumptions on the entries of the 
random matrices, we obtain 

E = »- 

Next, we divide into two parts 

E =E +E . 

^-^3 ^31 ^—'32 

where 

1 n 

(^) E 3i = ^E E ^ ' = m. 

" »=1 Jer 3i (2fc) 

For J G r 3 (2fc), we denote the number of distinct elements of Sj by t. By the 
definition of the category, we have t < k. Note that the random variables whose 
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subscripts have different absolute values are independent. Once we have specified 
the distinct numbers of Sj, the subscripts ji, • • • , j2k are determined in at most 
2 2k k 2k ways. If J € r 3 i(2fc), then t < ^f^-- Again by independence and the 
assumptions on the matrix elements (|1.7[) . we find 



When J e r 32 (2fc), there exist po, qo € [2k] such that 

^ 2k 

jpo — 3qo — ^(jpo + Jio ~ / ji;)- 

-3=1 

We can choose the other at most k — 1 distinct numbers, which determine j Po = 
j 9o . This shows that there is a loss of at least one degree of freedom, thus the 
contribution of such terms is 0(n _1 ), i.e. 

i£ ffl i = °(i). 

Since the main contribution comes from the category 1, each term K[aj] = 1 for 
J G Ti(2fc). So we can rewrite 
(3.7) 

n b 2k I 

E[hr(Af)] = (1) + -L £ £ £ n^nlCi + E^-w)- 

" 7rG-P 2 (2fe) »=1 ii,- ,jk=-bn 1 = 1 9=1 

For fixed tt e 7> 2 (2fc), 

_^ n fc„ 2fc i 

^e e n^M^+E^^w)' 

™ i=l ,3k=-b„ 1=1 9=1 



i.e. 

n b n 2k I , 

™ i=i h,-,n=-i>ni=i 9=1 
can be considered as a Riemann sum of the definite integral 



2k l k 



/ Y[l[o,i](xo + b^2e 7 ,(q)xi r (q))Y[dx l . 

■W]x[-l.l]*i = i 9 =i ; =0 

As in the above arguments, by Lemma 13.21 and the assumptions on the matrix 
elements (| 1 . T[) . we have 



(3.8) 



E[itr(Af +1 )] 



n 

nb n 2 i=i 



since V 2 (2k - 1) = 0. 

This completes the proof. □ 

Remark 3.4. When b — 0, we can easily get M 2 fc = 2 k (2k — 1)11. This is just 
the 2k- moment of the normal distribution with variance 2, which is also obtained 
independently by Basak and Bose |6. and Kargin [53]. However, for b > it is quite 
difficult to calculate M 2 fc because the integrals in the sum of (|3.1[) are not all the 
same for different partitions 7r's, some of which are too hard to evaluate. 



10 



D.-Z. LIU X. SUN AND Z.-D. WANG 



4. COVARIANCE 

In this section we evaluate the covariance of ui p and uj q . Recall 



(4.1) 



rcr - n 



(tr«) - E[tr«)]) = — ^ V V J j (aj - E[aj]) 5 , 



n n&„ 2 i=i J 

= — 4izr£ E 1 ^'^-^)^ - em)] 

n 2 &« 2 i,t' J,J' 

(4-2) = 4iTT E E J -> ^' ( E [«^'] - n^jMaj'}) , 

n 2 b n 2 i,i> J,J' 

where the summation ^2 runs over all balanced vectors J G {±1, • • • ,±b n } p and 
J, J' 

J' e{±l,...,±b n }". 

The main result of this section can be stated as follows: 

Theorem 4.1. Using the notations in section^ for fixed p, q > 2, as n — > oo 

we have 

(4-3) _^ _^ 

E[wp Wg ]— ^«t m = (/fM+tfw) + («-i) E (/jiW+/iw) 

when p + q is even and 

(4.4) E[w p w g ]=o(l) 
w/iera p + q is odd. 

When p = q we denote cr^g by cr 2 , where cr p denotes the standard deviation. 
From the above theorem, we can obtain the variance of loq in Theorem 11.11 

p p 

(4.5) ^EEW]^' 

By the independence of matrix entries, the only non-zero terms in the sum of 
(|4.2p come from pairs of balanced vectors J = ■ ■ ■ ,j p ) and J' — . . . ,j' q ) 
such that 

(i) The projections Sj and Sji of J and J' have at least one element in common; 

(ii) Each number in the union of Sj and Sji occurs at least two times. 

Definition 4.2. Any pair of balanced vectors J = (ji, . . . ,j p ) and J' — . . . ,j' q ) 
satisfying (i) is called correlated. If j u € J(j u is a component of J) and \j u \ € 
Sj p| Sji, then j u is called a joint point of the ordered correlated pair. 

To observe which correlated pairs lead to the main contribution to the covariance, 
we next construct a balanced vector of dimension (p + q — 2) from each correlated 
pair J of dimension p and J' of dimension q. Although the corresponding map 
of correlated pairs to such balanced vectors is not one to one, the number of pre- 
images for a balanced vector of dimension {p + q — 2) is finite (only depending on p 
and q). We will study the resulting balanced vectors in a similar way in section [3j 



thus 
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Proof of Theorem \4-l\ Let us first construct a map from the ordered correlated 
pair J and J' as follows. Let j u £ J be the first joint point(whose subscript is the 
smallest) of the ordered correlated pair J and J', and let j' v be the first element 
in J' such that \j u \ = \j' v \. If j u = —j' v , we construct a vector L = (Zi, . . . , lp+q-2) 
such that 



^1 jl 7 ■ ■ • Ju—l — ju—1 ; ^U 

7 

<-u+v— 1 — Jti+1 ) ' 

It is obvious that 



• j lu+q 



1 lu+v — 2 — j v — Ij 

-2 = jg, lu+q-1 = ju+1, ■ ■ 



lp+q-2 — Jp- 



p+q-2 

E 

fc=i 



^ = 0, 



so L is balanced. If j u = j' v , then from J and — J' = 



-j') we proceed as 



in the way above. We call this process of constructing L from J and J' a reduction 
step and denote it by L — J\J 'uiJ' ■ 

Remark 4.3. From the construction above, for any joint point of J and J' a reduc- 
tion step can be done in the same way. Given 9 e Sj f] Sj>, when saying J\J g J', 
we mean that there exists certain joint point of J and J' j u satisfying \j u \ = 9 and 
J\J e J' is the vector after this reduction step. In this section, j u is always the first 
joint point. While in section [5j j u may denote other joint points which is clear in 
the context. 

Notice that the reduction might cause the appearance of one number with mul- 
tiplicity 1 in Sl-, although each number in the union of Sj and Sj> occurs at least 
two times. If so, the resulting number with multiplicity 1 in Sl must be coincident 
with the joint point j u . In addition, to estimate which terms lead to main contri- 
bution to higher moments of tr(^), we will use the reduction steps and mark the 
appearance of the numbers with multiplicity 1 in section [5] 

Next, we assume we have a balanced vector L of dimension (p + q — 2), we shall 
estimate in how many different ways it can be obtained from correlated pairs of 
dimensions p and q. First, we have to choose some component /„ in the first half 
of the vector, 1 < u < p such that 



(4.6) 

Set J = (ji 
(4.7) 



u+q-2 



, j p ) with 



ji = h, 



,Ju-l 



-l,Ju 



i+g-2 

E u 



1. 



Ju+l 



«u-\-q— 1) ■ 



p+q-2- 



We also have to choose some component 1 < v < q — 1 such that 



(4.8) 



u+q-2 

E - 



whenever v > 2. Set J' — (j[, . . . ,j' q ) with 



(4.9) j{ =/„,... ,j' v _! = l u+v - 2 ,j' v = 



u+q-2 



v-2 



li 1 3v+l lu+v- 



•3 jp 



+9-2 • 
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If j u is the joint point of the constructed correlated pair J and J' and j' v is the 
corresponding element in J', then the pair {J, J'} or {J, —J'} is the pre-image of 
L. Note that since when u = v = 1 the conditions (14.61) and (|4.8|) are satisfied, 
the pre-image of L always exists. A simple estimation shows that the number of 
pre-images of L is at most 2pq, not depending on n (we will see this fact plays an 
important role in the estimation of higher moments in section [5]) . 

Since there is at most one element with multiplicity 1 in Sl, if there is one 
number with multiplicity 1, then the number will be determined by others because 
of the balance of L. Consequently, the degree of freedom for such terms is at most 
p+g ~ 2 ~ 1 . Therefore, the sum of these terms will be Ofin 1 ^ 2 ), which can be omitted. 
Now we suppose each number in Si occurs at least two times. Recall the procedure 
in section [31 and we know that the main contribution to the covariance (|4.2[) comes 
from the L € Ti(p + q — 2), which implies K[uj p Lo q ] = o(l) when p + q is odd. When 
p + q is even, for L G Ti(p + q — 2) the weight 

p q p q 

(4.10) E[ajaf] - ~E[aj]E[aj>] = \ J a, \\a, - E[JJ OjjEfQ ay t ] 

s=l t=l s=l t=l 

equals to 1 if j u is not coincident with any component of L; otherwise the weight 
is either E[|a Ju | 4 ] = k or E[|a Ju | 4 ] - (E[\a 3u \ 2 ]) 2 = k - 1. 

So far we have found such terms leading to the main contribution, now we cal- 
culate the variance. Based on whether or not the fourth moment appears, we eval- 
uate the covariance. If the fourth moment doesn't appear, then ji, . . . , j p , j[, . . . t j' 
match in pairs. In the abstract, by their subscripts they can be treated as pair 
partitions of {1, 2, . . . ,p,p+ 1, . . . ,p + q} but with at least one crossing match (i.e., 
^{PtQ) as in section [5]). Thus, for every n € Vi{p,q\ the summation can be a 
Riemann sum and its limit becomes f7(jr) ( it is /j"(7r) when the first coincident 
components in J and J' have the same sign). On the other hand, if the fourth mo- 
ment does appear, then ji, . . . , j p , j[, . . . , j' match in pairs except that there exist a 

p p 

block with four elements. Therefore, from the balance of 3k — and fk = ®i 

k=l k=l 

we know that the main contribution must come from such partitions: ji,. ■ ■ ,j p and 
j[, . . . both form pair partitions; the block with four elements take respectively 
from a pair of ji, . . . , j p and j[, . . . , j' . Otherwise, the degree of freedom decreases 
by at least one. Similarly, for every tt £ ^2,4(^9), the corresponding summation 
can be a Riemann sum and its limit becomes f7j{^) (it is /jj(7t) when the first 
coincident components in J and J' have the same sign). 

In a similar way as in section [3l noting that the coincident components in J and 
J' may have the same or opposite sign, we conclude with the notations in section 
M that 

(4.11) 



as n — > 00. 

This completes the proof. 



□ 
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5. Higher Moments of tr(A^) 

Let 25„ !P denote the set of all balanced vectors J — (ji, . . . , j p ) £ {±1, • ■ • , ±6„} p . 
Let %$ ntPt i (1 < i < n) be a subset of *B njP such that J £ S^p^ if and only if 

t 

Vt£{l,...,p}, l<i + Y,h<n. 
With these notations, Lemma \'3 . 2 1 could be rewritten as 

n 

(5.1) tr(T£) = £ £ oj. 

i=i Jes„,„ j 



To finish the proof of Theorem ll.il it is sufficient to show that given pi,P2>" • ,Pi > 
2 and I > 1, as n — > oo we have 

(5.2) E[lo Pi uj P2 ■ --uj Pi ] — > E[g Pl g P2 ■ ■ ■ g Pl ], 

where {g p } p >2 is a Gaussian family with covariances a p>q = E[g p g q \. 
Then a CLT for 

(5.3) = — (trQ(A n ) - E[trQ(A n )]) 

n 

follows, with the variance 

p p 

(5.4) a 2 Q = ^^g^jff.j. 

The main idea is rather straightforward: in an analogous way to the one used in 
Eq. (|4.2[) . we will deal with 

E[o&---o;£] = 
(5.5) 

.t=i 

Remember that two balanced vectors J = (ji, . . . , j p ) and J' = (j 1; . . . , j' q ) are 
called correlated if the corresponding projections Sj and Sj> of J and J' have at 
least one element in common. 

Definition 5.1. Given a set of balanced vectors { J\, J2, . . . , J;}, a subset of bal- 
anced vectors J mjl ^Jm j2 ,■ ■ ■ ,Jm jt is called a cluster if 

1) for any pair J mi ,J mj from the subset one can find a chain of vectors J ms , also 
belongs to the subsets, which starts with J mi ends with J mj , such that any two 
neighboring vectors are correlated; 

2) the subset J mj ,J mj2 ■ ■ iJm jt cannot be enlarged with the preservation of 1). 

It is clear that the vectors corresponding to different clusters are disjoint. By 
this reason the mathematical expectation in (|5.5j) decomposes into the product of 
mathematical expectations corresponding to different clusters. We shall show that 
the leading contribution to (|5.5j) comes from products where all clusters consist 
exactly of two vectors, as is stated in Lemma l5~2l below. 



■ , , n I 
»1 «i=l /i£»,..,.i J;€9J».»..i. 1=1 
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Lemma 5.2. Provided I > 3, we have 
(5.6) 

, , , n I 

Pl + ---+p;-i 

2 e e n^ E 

»!,...,»,=! JiCSn,., .«, ,...,Ji€!8„,p,. 4 , t=l 



II <« 



aj ( - \&.[aj t \ 

t=i 



o(l) 



where the product FJ* in f\5. 6\) is taken over I vectors which exactly form a cluster. 

For fixed p, all the involved moments no higher than p are 0(1) because of 
uniform boundedness of matrix entries. On the other hand, < H t=1 Ij t < 1- So 
to prove Lemma l5.2l we just need to count the number of terms in (|5.6p . As before, 
to complete the estimation it suffices to replace *B n . p .i by Q3„.p. That is, it suffices 
to prove 

pi+---+pi-i — * 

(5.7) K 2 E 1 = °W 

Ji£<B„, P1 ,...,Ji£<B„, Pl 

where the summation is taken over I vectors which exactly form a cluster. 
Instead of Lemma \57Z[ we will prove 

Lemma 5.3. Provided I > 3, p = (pi, . . . ,pi) with positive integers pi,...,pi > 
2. Let Q5p be a subset of the Cartesian product %$ n ,pi X ••• X %$ n ,pi such that 
( Ji, J2, . . . , J;) G Q3 P i/ and only if 

(i) any element in {j\ =1 Sj i has at least two multiplicities in the union; 

(ii) Ji, J2, . . . , Ji make a cluster; 

fm^OeUL S Jt . 

Then we claim that 

Pi +Po ^ hpl —I 

(5.8) card\® p \ = o(b n * ). 

Notice that we list the condition (iii) which looks redundant from J € {±1, . . . , ±6„} p 
to emphasize the importance of (i.e., the diagonal matrix entry ao). In fact, if 
pi,£>2, ■ ■ ■ ,Pi are all even, even for these J £ {0, ±1, . . . , ±6„} p but under the con- 
ditions (i) and (ii), we can still get the above estimation. 

Proof of Lemma \5.S\ The intuitive idea of the proof is as follows: from condition (i) 
in Lemma l5~3"l regardless of the correlating condition, the cardinality of Q3 P (denoted 

P1+P2H hgj 

by card|Q3 p | for short) is 0(b n 2 ). We could say that the freedom degree is 
pi+P2+---+pi _ g u t eacn correlation means two vectors share a common element so 
that it will decrease the freedom degree by one. To form a cluster we need I — 1 
correlations. So card|Q3 p | = Q(b^ 2 ~ 2 ~~ {L ~ iy ). When I > 3, l-l > |. Thus we 



obtain the desired estimation in Lemma 15.31 However, unfortunately adding one 
correlation does not necessarily lead the freedom degree to decrease by one. Some 
may be redundant. So we have to make use of the correlations more efficiently 

As in section SI we do the reduction steps as long as the structure of the cluster 
is preserved. To say precisely, we start from checking Ji and J2 . If j u is the first 
joint point of J\ and J2 satisfying the condition that JiVlj 1^2 can still form a 
cluster with the other vectors, then we do this reduction step. If this kind of j u 
does not exist, we turn to check J\ and J3 in the same way. After each reduction 
step, we have a new cluster of vectors Ji, J2, . . . , Jr. We continue to check J\ and 
Ji as before. If we cannot do any reduction step, we stop. 



FLUCTUATION OF EIGENVALUES FOR RANDOM TOEPLITZ MATRICES 



15 



Suppose that we did m reduction steps in total. Then we have a new cluster of 
vectors J{, J 2 , . . . , J[, and the dimension of J[ is for 1 < i < I'. From the reduction 
process, for any 9 e Sj^ f)Sj> a , J' ai \l ' gJ'a 2 cannot form a cluster with the other 
vectors. The resulting cluster still satisfies condition (ii) in Lemma \b. 31 However, 
the condition (i) may fail because the joint point of a pair of correlated vectors could 
be triple multiplicity, thus after a reduction step its multiplicity becomes one. 

Note that after a reduction step the number of pre- images of the resulting vector 
only depends on the dimensions of the involving vectors, not depending on n. Thus 
we only need to estimate the degree of freedom of the reduced vectors J[, J 2 , . . . , J\,. 

Since after one reduction step the total dimension of vectors will decrease by two 
and the number of vectors will decrease by one, we have 

i 

(5.9) Y,Pi = Y,Pi + 2m 

i=l i=l 

and 

(5.10) 1 = 1' +771. 

Denote by Iq the number of single multiplicity elements in [L=i &J'- ■ Since one 
reduction step will add at most one element with single multiplicity, therefore 

(5.11) lo<m. 

Below, we will proceed according to two cases: V > 1 and V = 1. 

In the case V > 1 

To complete the proof of this case, we need some definitions and notations. 

Let U be the set consisting of all elements which belongs to at least two of 
Sj ; , . . . , Sj> v , i.e. U = {9\3i ^ j s.t. 9 € Sj> f| Sj>}. Since I' > 1 and J' x , . . . , J[, 

forms a cluster, we get U + 1 . For any 9 £ (Ji=i set He ='■ {Jj\ 9 or — 9 e 
J[, 1 < i < I'} and denote the number of vectors in He by he = card|i/e|. 

Obviously, {Jl\i<l'} = \J e& u H e- 
We notice the following three facts. 

Fact 1: For any 9 G U, h$ > 3. In fact, from the definition of U, we know 
hg > 2. If h$ = 2, the two vectors in He could still be reduced to one vector 
and the reduction doesn't affect their connection with the other vectors, which is a 
contradiction with our assumption that J[, J' 2 , . . . , J[, cannot be reduced. 

Fact 2: For any 9 and J[ £ Hg, the multiplicity of 9 in Sj' is one . Otherwise, 
there are two vectors belonging to Hg, for example, J[, J' 2 € Hg but Sjj has two 
#'s, then J\\J eJ-2 could be a reduction step and J[\/gJ 2 , J3, • • • , J!> still forms a 
cluster. 

Fact 3: For any different elements 9 and 7 in U<=i'^'^! card|i?0 p|if 7 | < 1. 
Otherwise, suppose J[ and Jj belongs to He p| H^ and J' k is an element of Hg other 
than J[ and Jj. From Fact 1, J' k must exist. Now J^\J g J' k can form a reduction 
step since J^\J ' g J' k could be correlated with Jj by 7 and other correlations won't be 
broken. 

Definition 5.4. V C U is called a dominating set of { J[, . . . , J[, } if {Jj'|l < i < 
l '} = {Jeev H 8- 
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Choose a minimal dominating set denoted by Uo, which means that any proper 
subset of Uq is not a dominating set. Since hi is a finite set, Uo must exist and 
write Uo = {9i\l < i < £}. For any 1 < j ' < f, since Wo\{0j} is not a dominating 
set , there exists G ■He j \Ui/j Once we have already known the elements 

of Ui=i Sj'\Uq, we know all the elements in Sj' other than thus 6*,- will be 

determined by the balance of J' k . . 
Set 

i=l 

,/ gl=i p 'i-'a- h I ; 

then the different way of choice of U»=i Sj'\Uq is 0(6 n 2 ). From Eqs. 

(|0]) and ([5TT0]) . we have 

^ 512 ^ ELi p[-k-h | ^ _ Pi - Q - (m - Z ) - (ft - I') 

Note that m - Z > from Eq. (|5.11l) . If ft > I', we have 

(SUl Pj-i)-(m-i )-(h-i') S' = l"»-' 

(5.13) 0(6„ 2 ) = o(6„ 2 ). 

If h — V and i = 1, the analysis is easy but a little complex. We will deal with it 
later. 

Now we focus on the situation that ft = V and t > 1. In this case, since {J,'|i < 
Z'} = \J eeUo Hg, Z' < Ei=i cardjHgJ = ft and the equity is true iff Hg i f] Hg j = 
for any 1 < i ^ j < t. Because J[, J' 2 , ■ ■ . , J{, form a cluster, without lost of 
generality, we could assume that {{Jj, eHe Sj>) n(Uj'eff 9 Sj>) ^ 0- Thus there 
exist J' S1 € -ff^j ,J' S2 € i?e 2 and 7 € Sj^ |~| Sj< . We know that 7 e U\Uq since 
H e z C\ H e 3 = for any 1 < i ^ j < t. 'From Fact 3, card|iJ 9s f| H 7 \ = or 1. 

Given the elements of (Ji=i Sj'\(hio U{7})i @i can be decided by the the balance 
of some J' € Hg^H-y. Then 7 can be decided by any vector in H y . Thus we have 

0(6„ - °) ways to decide J[, J' 2 , . . . , J'i,. From Eq. (|5.12[) . one gets 

gjll P^-'Q-fc-'iT 1 I (gj=l Pi-0-(m-i )-(h-i')-h-, E' = i Pi"' 

(5.14) 0(6„ 2 + '°)-0(&n 2 ) = 0(b n 2 )■ 



If ft = V and t = 1, which means that 9\ appears exactly one time in each 
<Sj;(l < * < Z'), we will divide the situation into three subcases. 
Case I: Z > 0. 

Without loss of generality, we suppose a € J[ is an element with single multiplicity 

in (Ji=i ^i- If ^ ne elements except for #1 and a are known, #1 could be determined 
from the balance of J' 2 and then a could be determined from the balance of J[ . So 
we have 

g'=i p'j- l o- h e 1 , 

o(&; 2 +0 ) 

ways of choice in sum. From hg 1 = ft = Z' and Eqs. (|5.9[) and (|5.10[) . we get 
0(&n 5 + ° 1 ) = 0(6^ 1 ) = (&— ). 
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Case II: Iq = and m > 0. 

As case I above, B\ is determined by other elements. To determine all the elements 
other than 6\, we have 0(b n 2 ) ways. From Eqs. (|5.9j) and (|5.10|) . we have 

£ p'j-l' Sn-i m £' = 1 Pi- 1 

0(b n 2 ) = 0(bn 2 2 ) = 0(b n 2 ). 

Case III: Iq = and m = 0. 

In this case, we cannot do any reduction step. 

We claim that there exists 7 g other than 7 7^ #1 such that the number of 
+7 and — 7 in 7 G S,j 1 are not equal (when p\ is even we can always find some 7 
other than 61 such that 7 occurs odd times in Sj-J. Otherwise, #1 = because of 
the balance of J\, which contradicts condition (iii) in Lemma 15.31 

To determine all the elements except for 9\ and 7, we have 

(Pl + ... + Pi)-(h 9l +1) 

0(b~ 2 ) 

ways. Since Hg 1 = {Ji, J2, . . . , J;}, iT 7 —ca,Td\He 1 H^rl = 1j which means if 7 = 
{Ji}. So we can determine #1 from the balance of Ji. Then 7 will be determined 
by the balance of J\. Since hg 1 = I, we have 

(Pi+...+ Pi )-(h 9l +i) ( P1+ ... +Pi )_i 
0(6„ 2 ) = o(6„ 2 ). 

Now we have completed the proof in the case of I' > 1. 

In the case £' = 1 

We also divide this case into two subcases. 
Case I': Z > 0. 

Suppose a G J[ is an element with single multiplicity in Ui=i ^ ^he elements 
other than a are known, a could be determined from the balance of J[ . So we have 

totally 0{b n 2 + ) ways. From V = 1 and Eqs. (|BT9]> . (^TOl) and fSTTT]) . we 
have 

0(&„ 2 +Zo 1 ) = 0(6„ 2 2 ) = o(6„ 2 ). 

Case II': l Q = 0. 

In this case, every element in S'j' appears at least twice. Thus we have totally 
0(6 n 2 ) ways. From I' = 1 and Eqs. ([S3]) and (|5.10l) . it follows that 

T 1' = 1 Pj gLlPj-j i-2 £j = i Pj~' 

0(6„ 2 ) = 0(b n 2 2 ) = o(b n 2 ) 

since I > 3. 

Now we have completed the proof in the case of I' = 1 . 

The proof of Lemma IS~3l is then complete. □ 

Remark 5.5. From the analysis of case III above, we could also understand why 
the technical but necessary condition of ciq = is assumed in the introduction. But 
when p is even, the assumption of ao = is not necessary. 
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Remark 5.6. From the calculation of mathematical expectation, variance, and 
higher moments in sections HI the factor Ij in the trace formula of 13. 21 can be 
replaced by a more general non-negative integrable bounded function. For example, 
if I j = 1 for all J, then the results of asymptotic distribution and fluctuation hold 
the same as in case where 6 = 0. More precisely, as n — > oo 

1 MlT U /25(p-l)!!, pis even; 

(5 - 15) ^ h E[ }\ ai > ]S o,t*-*\0, Pis odd 

°" Ji,---,3 P =-K 1=1 i=i K ' y 

and the fluctuation can be stated as in Corollarv l6.5l (For conciseness we replace b n 
by n there). From the above point of view and the extended results, our method 
gives a mechanism of producing a CLT. 



6. Extensions to other models 

In this section, we will make use of our method to deal with some random ma- 
trix models closely related to Toeplitz matrices, including Hermitian Toeplitz band 
matrices, Hankel band matrices, sparse Toeplitz and Hankel matrices, generalized 
Wishart-type Toeplitz matrices, etc. Besides, we will also consider the case of sev- 
eral random matrices in a flavor of free probability theory. Before we start our 
extensions, we first review and generalize the key procedures or arguments in cal- 
culating mathematical expectation (the distribution of eigenvalues) , covariance and 
higher moments in sections El IH [5] respectively as follows: 

1) "Good" trace formulae. See Lemma \'S. 2 1 The simple formula both represents 
the form of the matrix and translates our object of matrix entries to its subscripts 
ji, . . . , j p , which are integers satisfying some homogeneous equation. In the present 
paper, we mainly encounter such homogeneous equations as 

p 

(6.1) 5>j ; = 0, 

i=i 

where t\ can take ±1. We write J = (Ji, . . . ,j p ). 

2) "Balanced" vectors via some homogeneous equations. We can generalize the 
concept of balance: a vector J = (Ji, . ■ ■ ,j p ) is said to be balanced if its components 
satisfy one of finite fixed equations with the form of (|6.ip . From the balance of a 
vector, one could determine an element by knowing the other ones and solve one of 
the equations. 

3) Reduction via a joint point. Eliminate the joint point from two correlated 
vectors in some definite way, and one gets a new balanced vector. 

4) Choose a minimal dominating set by which the freedom degree can be reduced 
case-by-case. We should particularly be careful about 0. 

The CLT is essentially a consequence of the fact that we can omit the terms who 
have a cluster consisting of more than two vectors. Thus if the argument in section 
[5] is valid, the CLT is true. In the following models, we will establish a good trace 
formulae in each case and then balanced vector and reduction step can be defined 
in a natural way. With these equipment the analysis in section [5] is still valid after 
a small adaption. 

The mathematical expectation and covariance vary from case to case. But the 
way to calculate them is similar with each other. We will define some suitable 
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integrals associated with pair partitions similar to those in section [2j after which 
we can state the results concisely. 

We will only state main results but not give their proofs in detail, because the 
proofs would have been overloaded with unnecessary notations and minor differ- 
ences. When necessary, we might point out some differences of proofs. 

6.1. Hermitian Toeplitz band matrices. The case is very similar to real sym- 
metric Toeplitz band matrices, except that we now consider n-dimensional com- 
plex Hermitian matrices T n — (ryy ai-j)™ We assume that Reaj = Rea_j and 
Imoj = — Ima_j for j = 1, 2, • • • , and {Reaj, Imaj}j 6 N is a sequence of indepen- 
dent real random variables such that 

(6.2) E[a 3 ]=0, E[|a 3 | 2 ] = l and E[a|] = for j 6 N, 
(homogeneity of 4-th moments) 

(6.3) K = E[|a/], 
and further (uniform boundedness) 

(6.4) supE[|aj| fe ] = C k < oo for fceN. 

In addition, we also assume ao = and the bandwidth b n — > oo but b n /n — > b € 
[0,1]. 

Theorem 6.1. With above assumptions and notations, Theorem \l.l\ also holds for 
random Hermitian Toeplitz band matrices. 

Remark 6.2. The distribution of eigenvalues for this case has been proved to be the 
same as real case in [25]. The covariance like in (|4.2I) is slightly different from real 
case because E[a 2 ] = 0, which is given by 

(6-5) Y, //"W + («"!) E (/z"/W + //>«)• 

6.2. Hankel band matrices. A Hankel matrix H n = (frj+j-i)"j = i is closely re- 
lated to a Toeplitz matrix. Explicitly, let P n = (<5j-i,Ti-j)"j=i the "backward 
identity" permutation, then for a Toeplitz matrix of the form T n = [ai-j)"j =1 and 
a Hankel matrix of the form H n = (hi+j—i)f P n T n is a Hankel matrix and 
P n H n is a Toeplitz matrix. In this paper we always write a Hankel band matrix 
H n = P n T n where T n — (rjij o>i—j)fj =1 is a Toeplitz band matrix with bandwidth 
b n and the matrix entries a_ n +i, • • • , ao, • • • , a n -i are real-valued, thus H n is a real 
symmetric matrix. 

For Hankel band matrices, as in Toeplitz case we also have a trace formula and 
its derivation is similar, see [26] . 

Lemma 6.3. tr(If£) = 

' n b„ p p 

e e n a k n x[i,n](* 

*=1 3U— >jp = -bn '=1 ' = 1 
< n 6„ P P 

e e n a j, n X[i,n](* 

i=l ,j p =— b n 1=1 1=1 



E (-!)%) S p , p even; 

q=l 0,E(-lh? 

9 = 1 

E(-i)%H a, , . vodd. 
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From the above trace formula, our method can apply to the case that p is even. 
We consider random Hankel matrices satisfying the following assumptions: assum- 
ing that {aj : j 6 Z} is a sequence of independent real random variables such 
that 



(6.6) E[aj] = 0, E[|aj| 2 ] = 1 for j G Z, 
(homogeneity of 4-th moments) 

(6.7) K = E[| aj | 4 ],j eZ 
and further (uniform boundedness) 

(6.8) supE[\a 3 \ k ] = C k < oo for k > 3. 

In addition, we also assume the bandwidth b n — > oo but b n /n — > b £ [0, 1] as 
n — > oo. 

Theorem 6.4. Let H n be a real symmetric ( 16. 6\) - ![6.8\) ) random Hankel band 
matrix with the bandwidth b n , where b n /n — > b € [0, 1] but b n — > oo as n oo. Set 
A n = H n I \Jlh l and 

(6.9) C P = ^(tr(^)-E[tr(^)]). 
Then 

(6.10) Cp ^iv(o,a2) 

in distribution as n — » oo. Moreover, for a given polynomial 

v 

(6.11) Q{x) = Y J W 3 

3=0 

with degree p > 1, sei 

/5T 



(6.12) cq = (trQ(A ' } - E [ tr °^)0 ' 

we also have 

(6.13) Cg— >W(0,5$) 

m distribution as n — » oo. Tiere £/ie variances and Oq will be given below. 

We remark that the limit of ^tr(i/, i /v / 5^) p can be calculated in the same way 
as in Toeplitz case, see also [26] . Next, we derive briefly the variances cr 2 and Oq. 



By Lemma 16.31 rewrite 



(6.14) C P = ^T^2^2lj(aj-naj}) 5 2p 

nb— U J 

Here J = (ji,...,h P ) G {-&n, • • ■ , K} 2p , aj = H^a^, Ij = Ilfc=i X[i,n] {i ~ 
Yli=i(~^) l jl) an d the summation J^j runs over au possibile J € {— &„, . . . , b n } 2p . 
We call a vector J = (ii, . . . , J2 P ) € {— b n , . . . , 6„} 2p is balanced if 

2p 

(6.15) £(-1)'* = °- 
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Since cij and a_j are independent when j ^ 0, Sj, the projection of J, should be 
J itself (forget the order of its components) . 

The only difference is the definition of balance and projection. Using the same 
technique in section [4j we get 

(6.16) E[C P C,]— >* M = E 9i M + («-!) E 

^£^2(2^,29) 7rG-p 2 ,4(2p,2(3) 

as OO. 

Here for a fixed pair partition n, we construct a projective relation between two 
groups of unknowns yi, . . . , y2p+2q and xi, . . . , x p + 9 as follows: 

(6-17) m = yj = 

whenever i ~ w j. 

Ific€P 2 (2p,2q), let 

(6.18) 

9/W = / W E( -1 )^ X r 2 P+2g 1 

J[o,i] 2 x[-i,i]"+« / j (= i: (-i)**i=o > 

2p j 2p+2q j' p+q 

~[[x[o,i]( x o - b^2(-iyyi) Yi X[o,x\(y°~ b E (- l Yyi) d yoY[dxi. 

j=l i=l j'=2p+l i=2p+l i=0 



If 7T e 7\ 4 (2p,2g), let 
(6.19) 

= / X( 2 P )Xi .., 

J[o,i] 2 x[-i4] p +''- 1 i E(-i)*J/i=o> < E (-i)*w=o 



2p J 2p+2q / P+9-1 

n x[oi]N -6E( _i ) 4 ^) n x[o,i](yo-b e (-i)Vi)^yo n da;i - 

j=l i=l j'=2p+l i=2p+l i=0 

Because of the existence of the characteristic function in the integrals of type I 

2p+2q 2p 

and II, we see that gi(n) ^ if and only if {~^-) l Vi = when ^ (— l) l yi = 0. 

i=2p+l i=l 

Denote the subset of V2(2p, 2q) consisting of this kind of ir by "P^ (2p, 2q). 

2p+2q 2p 

9i i (n) 7^ if and only if ( — ^Tl/i = XX - ^-YVi = 0- Denote the subset 

i=2p+l i=l 

of 7^2,4 (2p, 2q) consisting of this kind of tt by V\\{2p,2q). From the definition of 
balance, if we denote Vj = {ii,*2j*3j*4} to be the block with four elements and 
1 < i\ < %2 < 2p < «3 < 14 < 2q, then ii + «2 and 13 + 24 must be odd. Moreover, 
j + k is odd provided j, k £ Vi and i ^ j. 
From the discussion above, we get 

(6.20) E[&Cg]— ►5i,,= E 5iW + («-!) E 

7i-G7 , 2 r (2p,2 g ) 7rG'P^ J 4 (2p,2 g ) 



3.21) 
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and 



V P 
2 _ \ ^ \ 

i,3' 



(6.22) *Q=l^2^<liW 

i=l j=l 

Similar to Corollary 10.51 we get another central limit theorem for product of 
independent random variables. 

Corollary 6.5. Suppose that {a,j : j G Z} is a sequence of independent random 
variables satisfying the assumptions (|6.6p - (|6.8p . For every p > 1, 

(6-23) -^~r >; ! IK I «„] I i: 




o,£(-i) J jj 
i=i 

converges in distribution to a Gaussian distribution N(0, cr^). Here the variance cr^ 
as above corresponds to the case b = 0. 

Remark 6.6. The derivation of higher moments can be calculated in the same way 
as in section[S] Careful examination shows that the argument in case III in section[5] 
(the existence of 7) is the only part depending on the definition of balance. However, 
since 2p and 2q are even, the dimensions of all vectors involved are even. Remark 
15.51 still applies so that one needn't care much about the definition of balance. 
Moreover, ag needn't equal to 0. The same thing happens in section [6T4l 



6.3. Sparse Toeplitz and Hankel matrices. A sparse matrix is a matrix in 
which some entries are replaced by 0, which occurs in some application background. 
A sparse random matrix provides a more natural and relevant description of the 
complex system in nuclei physics, we refer to [4] for an introduction. 
A sparse Toeplitz matrix can be expressed as follows. Let 

Bn= ^U^'^j O' 1 '---'"- 1 ) 



we define 

(6.24) ^ = 



€ B n ; 
otherwise. 



Then a sparse Toeplitz matrix is defined by 
(6-25) r n =(^a i _ j )^ =1 , 

and the corresponding Hankel matrix is H n = P n T n as before. Note that for Hankel 
matrices we can define a more general case with minor adaptations: 

B n = ]M[b^- 1 \b^}\ f|{0,±l,...,±(n-l)} 



and 
(6.26) 



I, i-je B n 
0, otherwise. 
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In addition, we assume that there exists a sequence {6 n }with b n — > oo but b n /n — > 
be [0, 1] such that -f > as n — > oo. Set 

r 

(6.27) B+ = (J [J/ 2 ^ , M 2j )] C [0, 1] 

With the above assumptions, we claim: 

Theorem 6.7. Under the same conditions therein, Theorems l 1 . 1\ and \6.1\ also hold 
for sparse random Toeplitz matrices, and Theorem \6.J\ also holds for sparse random 
Hankel matrices. 

We remark that in sparse matrix case, the only difference is integral interval of 
variables, for example, the moments in Eq. (|3.1[) become 

.2k j k 

(6.28) M 2k = / n^lWlt 10 + b^2e^(i)x^ (2) )Y[dxi, 

«ev 2 (2k) J ^ xBk 3=1 i=l 1=0 

where B = B+ [j B-, while B- := {x\ - x E B+). 

6.4. Singular values of powers of Toeplitz matrices. For a real or complex 
(random) Toeplitz band matrix T n — (jjij ai_j)™j =1 with the bandwidth b n , writing 
T* for the adjoint matrix of T n , we consider the product T* S T£ of powers with some 
s G {1,2,...,}. When the Toeplitz T n is replaced by a n x n matrix, the norm of 
product of random matrices in the above form was studied in [5] , when investigating 
the limiting behavior of solutions to large systems of linear equations. Recently, 
the limiting distribution of the product of random matrices has been studied in [1] 
and its moments is known as Fuss-Catalan numbers. 
Set 

(6.29) WW = It^T*. 

For real Toeplitz case, we assume that {aj : j € Z} is a sequence of independent 
real random variables satisfying f (|6.6p - (|6.8|) ). For complex Toeplitz case, we assume 
that {Reaj, Imaj}j 6 z is a sequence of independent real random variables such that 

(6.30) E[aj] = 0, E[|aj| 2 ] = 1 and E[a 2 ] = for j E Z, 
(homogeneity of 4-th moments) 

and further (uniform boundedness) 



3.31) k = E[\c 141 



(6.32) supE[| aj r] = C k < oo for k EE. 

In addition, we also assume the bandwidth b n — > oo but b n /n — > b G [0, 1] as 
n — » oo. Note that we have the trace formulas as in Eq. (|3.2[) (more like the case 
of even p in Eq. (|6.6p ) 

1 n 

(6.33) tr ((W^r) = Y.Y.^^ 5 ri-i, ' P^ n - 

v y 6 " m T o,E(-i) [ — lo ii 

1-1 J i=i 
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Here J = (ji,.. . ,j 2ps ) & {-&„, . . . , M 2ps , ^ = IIfc=i + Eli ("l)^ 1 ^) 

and a,/ = nf=i %r No denotes the largest integer not larger than x, and 



(6.34) 



_ _ f Oj,, if [ l -f-]o is even; 
1 , otherwise. 

On the other hand, we remark that when s = 1 the real Toeplitz case reads 
(6.35) tr((WWr)=^tr(fl^), 

where H n denotes the Hankel matrix as in Theorem 16.41 Analogous to Theorem 
16.41 we have the following theorem for real and complex Toeplitz matrices: 



Theorem 6.8. Under the above assumptions, for a given polynomial 

v 

(6.36) Q{x)=^q j x : > 

3=0 

with degree p > 1, then 
n 

in distribution as n — > oo. Here er?, denotes the variance. 



(6.37) ^ (tvQ(W^) E[trQ(W^)}) — ► # (0, ^) 



Remark 6.9. Because all the involved vectors have even dimensions, according to 
Remark 16.61 the assumption of ao = is not necessary. 

We can evaluate the variance Oq as in the Hankel case, but its expression is very 
redundant so we omit it. Here we only gives the limit of p-moment, i.e., 



~tr(Vi 8) ) P 



E 
(6.38) 

2ps j ps 

Here for a given partition 7r S Vi(2ps\ the unknowns {yi, . . . , j/2ps} and ■ • ■ , x ps } 
satisfies the relation: 

(6.39) yi = y d = 
whenever i ~ w j. 

Remark 6.10. The non-zero terms in the summation of Eq. (|6.38p come from such 
partitions as 

(6.40) (-1)^1° + (-1)^° = 

whenever i ~ n j. We can treat the desired pair partitions as follows: considering 
2p alternative groups 



each of which consists of s l's or s (-l)'s. Then the pair partitions satisfying (|6.40|) 
correspond to the pair matches of these 2ps 1 or -1 such that each pair has one 1 and 
one -1. The total number of this kind of partitions is (ps)!. Thus, in the special case 
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6 = 0, the p-th. moment in (|6.38|) equals 2 ps (ps)\. By the property of T- function, one 
knows that 2 ps (ps)\ is the p-th. moment of the density f(x) — j^x~^~ exp(— hxi) 
on [0,oo). However, unfortunately when s > 1 the density fix) is not uniquely 
determined by its moments, see |16j . Thus we cannot say that the distribution of 
eigenvalues of Wn^ converges weakly to f(x). 

6.5. Product of several Toeplitz matrices. In free probability theory, one usu- 
ally considers the limit of joint moments of several independent matrices [Ti?l 128] . 
Based on this point of view, the first version [25] of [55] by two of the authors, 
provided the limit joint distribution of several independent Toeplitz and Hankel 
matrices. In the present paper, we will prove Gaussian fluctuation of the joint mo- 
ments for symmetric Toeplitz matrices (the same result proves right for Hermitian 
Toeplitz case and Hankel matrices ). More precisely, given independent symmetric 
Toeplitz band matrices Ti tTl , T-i^m ■ ■ ■ , T r>n , we study the limit and fluctuation of 

for 1 < ii,i2,. .., ip < r. 

First of all, we introduce some notations. For given 1 < i\, ■ ■ ■ ,i p , i p +\, • • • , i P + q < 
r, there exists an associated partition tt — {V\, ■ ■ ■ , V s } of \p + q] such that 

(6.41) j ~„. k <^=> ij =i k . 

Let V^ip, q) denote a subset of Vi(j>, q), which consists of such pair partitions tt: 

(6.42) j ^ k => j ~ 7ro k. 

When q = 0, we denote V^ip, 0) by V^ip)- Similarly, V% 4(^,5) denotes a subset of 
^2,4 (Pi q)i which consists of such partitions tt: 

(6.43) j ^ k=^ j ~ 7ro k. 

Theorem 6.11. Let Ti, n , T2 ; „, . . . , T rj „ be independent real symmetric random 
Toeplitz band matrices, each of which satisfies the assumptions in Theorem 
with the above notations, we have 



p ^[tr(Tj 1 ; n-^2,n ' ' ' -^i p ,n)] ^ 

nbn 

P P/2 

£ /[oiixf-iilp/2 II X[o,i](^o + 6ELi^(*)^w) II da; ^ evenp; 

7rS-P 2 °(p) ' j=i 1=0 

0, odd p 

as n — >■ 00. Now, set 

(6.44) _ 

S n {i\,i2, ...,i P ) = (tr(Ti n Tj 2 „ • • • 1* „) - EftrfT^ „T i2 „ • • • Ti „)]) . 

n 7,2 

On 

Then for p > 2 the family {S n (ii, . . . , i p ) : 1 < i\, ■ ■ ■ , i p < r} converges in 
distribution to a Gaussian family. For fixed p, q > 2, we /iaue 

E^n^i,^, ■ ■ - ,ip) S n (i p+ i,ip+2, ■ ■ ■ , ip+q)] ► 

(6.45) (//"W + /z + W) + («"!) E + 

ir£V°(p,q) neV°.(p,q) 
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when p + q is even and 

(6.46) E[S n (ii,i 2 , ■ ■ ■ ,i p ) S n (ip + i,i p+2 , ■ . ■ ,i p+q )} = o(l) 

w/ien p + q is odd as n — > oo . 
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